Reinforcement Learning of Local Shape in the Game of Go
نویسندگان
چکیده
We explore an application to the game of Go of a reinforcement learning approach based on a linear evaluation function and large numbers of binary features. This strategy has proved effective in game playing programs and other reinforcement learning applications. We apply this strategy to Go by creating over a million features based on templates for small fragments of the board, and then use temporal difference learning and self-play. This method identifies hundreds of low level shapes with recognisable significance to expert Go players, and provides quantitive estimates of their values. We analyse the relative contributions to performance of templates of different types and sizes. Our results show that small, translation-invariant templates are surprisingly effective. We assess the performance of our program by playing against the Average Liberty Player and a variety of computer opponents on the 9×9Computer Go Server. Our linear evaluation function appears to outperform all other static evaluation functions that do not incorporate substantial domain knowledge.
منابع مشابه
An Adaptive Learning Game for Autistic Children using Reinforcement Learning and Fuzzy Logic
This paper, presents an adapted serious game for rating social ability in children with autism spectrum disorder (ASD). The required measurements are obtained by challenges of the proposed serious game. The proposed serious game uses reinforcement learning concepts for being adaptive. It is based on fuzzy logic to evaluate the social ability level of the children with ASD. The game adapts itsel...
متن کاملDevelopment of Reinforcement Learning Algorithm to Study the Capacity Withholding in Electricity Energy Markets
This paper addresses the possibility of capacity withholding by energy producers, who seek to increase the market price and their own profits. The energy market is simulated as an iterative game, where each state game corresponds to an hourly energy auction with uniform pricing mechanism. The producers are modeled as agents that interact with their environment through reinforcement learning (RL...
متن کاملImitation Learning in The Game of Go with Joseki Options
Scaling reinforcement learning methods to large, challenging decision making tasks can potentially benefit from integrating domain specific knowledge in a principled manner. This synthesis focuses on applying two forms of domain knowledge about the game of Go to improve learning performance on what continues to be an extremely challenging task. First, learning is bootstrapped by using reinforce...
متن کاملMastering the game of Go from scratch
In this report we pursue a transfer-learning inspired approach to learning to play the game of Go through pure self-play reinforcement learning. We train a policy network on a 5 ⇥ 5 Go board, and evaluate a mechanism for transferring this knowledge to a larger board size. Although our model did learn a few interesting strategies on the 5 ⇥ 5 board, it never achieved human level, and the transfe...
متن کاملApplication of Stochastic Optimal Control, Game Theory and Information Fusion for Cyber Defense Modelling
The present paper addresses an effective cyber defense model by applying information fusion based game theoretical approaches. In the present paper, we are trying to improve previous models by applying stochastic optimal control and robust optimization techniques. Jump processes are applied to model different and complex situations in cyber games. Applying jump processes we propose some m...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007